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Abstract —Motivated by recent high bandwidth communica¬ 
tion systems, Inter-Symbol Interference (ISI) channels with 1- 
bit quantized output are considered under an average-power- 
constrained continuous input. While the exact capacity is difficult 
to characterize, an approximation that matches with the exact 
channel output up to a probability of error is provided. The 
approximation does not have additive noise, but constrains the 
channel output (without noise) to be above a threshold in absolute 
value. The capacity under the approximation is computed us¬ 
ing methods involving standard Gibbs distributions. Markovian 
achievable schemes approaching the approximate capacity are 
provided. The methods used over the approximate ISI channel 
result in ideas for practical coding schemes for ISI channels with 
1-bit output quantization. 

I. Introduction 

Channels with Inter-Symbol Interference (ISI) and Additive 
White Gaussian Noise (AWGN) are often encountered in prac¬ 
tice. Depending on the application, an average-input power 
constraint or a finite input alphabet constraint is commonly 
studied. Recently, in applications such as millimeter wave 
[ 1] [2] or optical or intra-chip [3] communications, quantization 
of the output of an ISI channel has been considered because of 
limitations in Analog-to-Digital conversion at high speeds. In 
some cases, the output quantization may be as low as a single 
bit. Since the transmitters in some of these systems can be 
more complex and operate at high powers, the channel input 
may not have severe quantization limits. 

Motivated by the above applications, we consider a noisy 
ISI channel with average-power constrained continuous input 
and 1-bit quantized output. The available literature mostly 
considers either continuous input/output alphabets or a finite 
input alphabet with a continuous output alphabet [4] [5]. The 
quantized output case has been considered for the AWGN 
channel with no ISI [6], and the ISI case has been briefly 
addressed recently [7]. 

The exact capacity of an ISI channel with 1-bit quantized 
output appears to be difficult to characterize explicitly. In 
this work, we introduce an approximation to the ISI channel 
model with 1-bit quantized output. The approximation does 
not have additive noise, but constrains the channel output 
(without noise) to be above a threshold in absolute value. 
Because of the thresholding, the approximate channel output, 
after quantization, matches the actual channel output up to a 
probability of error that can be controlled by the threshold. 


The main advantage of avoiding noise in the approximation 
is that the exact capacity can be computed for the approximate 
model. We show how such a computation can be carried out 
using Gibbs distributions. In addition, we exhibit achievable 
schemes with Markov input approaching the approximate 
capacity in some numerical examples. Since the approximation 
is valid up to a probability of error, a coding scheme used over 
the approximate channel model can be coupled with a standard 
error control code to derive a practical coding scheme for the 
exact channel. 

The rest of the paper is organized as follows. Section II 
describes the ISI channel model and its approximation. In 
Section III, we provide a method for computing the approxi¬ 
mate capacity under an average power constraint on the input, 
and elaborate on achievable schemes in Section IV. Numerical 
results are given in Section V, and concluding remarks are 
made in Section VI. 


II. System Model 

We consider a discrete-time finite-tap ISI channel with 
average-power-constrained continuous input and one-bit quan¬ 
tized output as depicted in Fig. 1. The input to the channel is 
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Fig. 1: ISI channel with quantized output. 


denoted X = {X n , 0 < n < N — 1}, and the channel impulse 
response of length L is denoted h = {h n , 0 < n < L— 1}. The 
convolution of the input with the channel impulse response is 
denoted Y = {Y n , 0 < n < N — 1}, and is given by 

L—l 

Y n = J2 h kXn-k. (1) 

fc =0 

The channel h is assumed to be constant, and all signals are 
assumed to be zero outside the specified ranges. Independent 
and identically distributed zero-mean Gaussian noise of vari¬ 
ance er 2 , denoted Z n , is added to obtain an intermediate signal 
Rn = Yi + Z n . The signal R n is quantized by a 1-bit quantizer 








Q(-) to obtain the channel output Q(R) = { Q(R n ), 0 < n < 
N + L — 2}. The quantizer is defined as follows: 


Q(x ) 


+1, if x > 0, 
— 1, if x < 0. 


( 2 ) 


The average power of the input is constrained to be at most 
P. That is, we require 

N -1 

E[||X|| 2 ] = ]T E[\x n \ 2 } < NP. (3) 

n —0 


III. Approximate Capacity 


The capacity of the approximate ISI channel is given by 


Cs(P) 


lim sup 

E[||X|| 2 ]<JVP 

|V»I>« 


N 


H(S ) 

N ’ 


where the last equality is because the sequence S' is a de¬ 
terministic function of the input X in the absence of noise 
in the approximate channel. Since the output alphabet of 
the quantizer is either 1 or —1, it is easy to observe that 

C S (P) < 1. 


In this work, the overall goal is to approximate the mutual 
information rate jjI(X;Q(R)) and provide computable ex¬ 
pressions or bounds. 

The approximate ISI channel model is depicted in Fig. 2. 
In the approximate ISI channel, there is no noise. However, 



Fig. 2: Approximate ISI channel with quantized output. 

the convolution output Y n is constrained to be greater than 5 
in absolute value, and this provides justification for ignoring 
noise. We readily see that, under the constraint \Y n \ > 6, 
the output of the actual model Q(R n ) is approximated by 
S n = Q(Y n ) U P to a probability of error lesser than or 
equal to 0.(5/ cr), where Q(x) = j/° -^=e _ “ ^ 2 du is the 
standard Q-function. Therefore, coding schemes developed 
for the approximate model can be used in the actual model 
with additional error control coding for the approximation 
error 0(6/a). While the additional coding results in a loss 
in information rate, we find that the approximation, because 
of the removal of noise, is useful on the following two counts: 

1) The capacity or maximum information rate jjI(X;S), 
where S = {S' rt ,0 < n < N — 1}, under the 
power constraint of (3) and the constraint \Y n \ > 6, 
has computable expressions and bounds. This provides 
useful estimates of the capacity of output-quantized ISI 
channels. 

2) The techniques used for computing the approximate 
capacity provide useful intuition on coding and signaling 
methods for output-quantized ISI channels. 

Convolution : We will use matrix notation to denote the con¬ 
volution in (1) as Y = MhX, where the entries of the N x N 
matrix Mh are either 0 or one of the channel taps hk and 
X, Y are column vectors. The matrix Mh is not circulant. 
Let Mh denote the N x N circulant matrix with first column 
equal to h. When N becomes large (as is in our case), with L 
fixed, Mh behaves like the circulant matrix A//, in the sense 
that limN^oo \\Mh~ Mh\\ = 0 where ||.|j is a matrix norm. In 
this paper 1 , for simplicity, we will always assume the circular 
convolution Y = MhX in the channel model. 

1 [ or all N . the results hold with a suitable cyclic prefix, for instance. 


A. Power constraint 


We begin by bounding the power of the input sequence 
X required for a given output sequence S. Given the output 
symbol sequence S = s € {—1,1}^, we have the constraint 
that \Y n \ = s n Y n > 5. So, the minimum energy, denoted 
£(s), required for a given quantized sequence s is given by 
the following optimization problem: 


£( s ) 


N—l 

min V' \x n \ 2 , 

diag(s) Mh,xy <51 ^ J 


(4) 


where diag(s) is an N x N diagonal matrix with s on the 
main diagonal and 1 denotes the all-1 column. The inequalities 
diag ( s)MhX A <51 are linear and the feasible space for x is 
the intersection of hyperplanes and, hence, convex. So, the 
above optimization problem is a convex optimization problem 
in N variables that essentially finds the closest point from the 
origin to the convex set { dia , g ( s)MhX A <51}. We have 


E[||X|| 2 ] = £ P(S = s)E[||X|| 2 | 5 = s], 

> Y, ns = s)£( S ) 

s£{- 1 , 1 } n 


using (4). Because of the average power constraint on the input 
X, we have 


p (>5 = s )£( s ) < NP. (5) 

«e{-i,i} iv 

B. Entropy maximization and Gibbs distribution 

Let £min = min s £ ( s ). f max — max s £(s), £ — 
2 iv^ s £(s). Since the constraints (5) are linear on the 
probabilities, it is well-known [8] that the Gibbs distribution 
maximizes the entropy H(S) for £,„in < NP < £ max . The 
optimal distribution is the Gibbs distribution given by 

j£lsl 

F(S = s) = ^^,s&{- 1,1} n , ( 6 ) 

where Z is the normalizing constant, and ti is the unique value 
for which (5) is met with equality. The maximum entropy is 
given by 


H(S) =pP + In (Z) 
= j3P — In 


(7) 


E e 

e{-i,i} N 




( 8 ) 





For NP < £ m ; n , there exists no probability distribution 
that satisfies (5). It is also known [8] [9] that /3 = 0 when 
NP = £ and we observe that the corresponding Gibbs 
distribution is the uniform distribution on {—1, 1 } N and the 
maximum H(S) = N. For P = £ m i n , we have /3 = oo and 
the maximum entropy is given by log 2 (|<S m |), where |<S m | 
denotes the number of sequences in {—1, l} Ar that achieve 
the minimum energy. Hence, the maximum entropy when 
P = £ min is H(S) = log2( ]f ml) . 

In summary, we see that the capacity of the approximate ISI 
channel is given by the Gibbs distribution whenever the power 
constraint is above £ m i n /N. For P > £/N, we can achieve 
the maximum possible capacity Cg{P) = 1. For powers lower 
than £ m in /N, capacity goes to zero. So, the interesting range 
of calculation if for NP £ (£ m ; n , £). 


We want A* ^ 0 for every s. We can easily observe that this 
is indeed true if is diagonally-dominant. ■ 

Hence, from Lemma 1, when is diagonally- 

dominant, x* = 6Mr 1 s. Hence, 

£(s) = \\x*\\ 2 = 6 2 s t Gs, (12) 

where G = 

In the next lemma, the mean energy £ is characterized 
in terms of the channel matrix A//, for diagonally-dominant 
channels. 

Lemma 2. The mean energy for diagonally-dominant chan¬ 
nels is given by 

£ = 6 2 tr(Mf T Mf 1 ). 


C. Diagonally-dominant channels 

A matrix A = (a t j) is said to be row-diagonally-dominant 
or simply diagonally-dominant if \au\ > Yhj j^i \ a ij I- Let 
us call channels h for which the matrix exists 

and is diagonally-dominant as diagonally-dominant channels. 
For such channels, the minimum energy values £(s) can be 
characterized as follows. 

Lemma 1. When the matrix is row-diagonally- 

dominant, £(s) for s £ {— 1,1} Ar is achieved at x* that 
satisfies the equality constraints 

diag (s)MhX* = 51. (9) 

Proof: Since the constraint set is linear, the optimization 
problem (4) is strongly dual. We now solve the optimization 
problem by forming its dual. The Lagrangian is given by 

L{ X,x) — ||a’|| 2 + A T (5l — diag(a:)M^a;). 

The gradient of the Lagrangian with respect to x is given by 
2a; — (diag(a;)M? l ) T A, which gives 

* _ (diag{x)M h ) T X 
1 2 

Substituting a:* in the Lagrangian, the dual problem is given 
by 

max -^'lagMAWt Ji.gWA + ^ (10) 

A^O 4 

“ S -V-' 

aW 

We have 

v 9 (A) = ~ Iliag(< ' |Ml ; Agd i»8(«)^ +n (U) 

Setting the gradient to zero in (11), we obtain the optimal a:* 
as the solution to 

— diag(s)M h x* + 51 = 0, 

which is exactly equivalent to (9). The only caveat is that the 
A obtained from g{ A) = 0 should be in the positive orthant. 
Solving g(A) = 0, we obtain 

A* = 25 diag(s)(M/,M^) -1 diag(s)l. 


Proof: The energy £{s) in (12) can be expanded as 


N 


f(5) = 5 2 ^G, ;i + Y, G i. 


SiSj. 


(13) 


*=i idXAi 


Hence, 


E = 


i 

2 "n 

se{-i,i} N 

E E Gu +E Gi J fp E SiS i 


2 n 


«£{-!,1}" *= 1 


2 n 

se{-i,i} i ' 

=o 


The second term in the above sum is zero since the summation 
spans over all the sequences on {1, —1}^. Hence 


N 


£ = 5 2 ^G ii ( = ) 5 2 tr(M ft - T M ft - 1 ), 


i=l 


where (a) follows from the definition of the matrix G. ■ 
We now characterize Ph = linijv_>. 0 0 £/N, which is the 
minimum average power needed for capacity of 1 bit, in terms 
of the Fourier transform of the channel h. Let the discrete- 
Fourier transform of the channel be 


L—l 


/(A) = E h ^ jkX - 

k—0 


Since Mh is a circulant matrix, it is easy to see that 

f = 5 2 = <5 2 E |f( l )|2 - 

Using standard arguments, it can be shown that [10] 

- 5 2 f 271 1 

Ph "2ih IMF 


(14) 


(15) 


(16) 


Observe that Ph is the energy of the inverse of the channel 
scaled by 5 2 , and is related to the power needed for zero¬ 
forcing. 



In summary, for diagonally-dominant channels, the approx¬ 
imate ISI capacity is given by 

f 1 if P>Ph, 

C S {P) = l (3P + In (Z) if P h <P< P h , 

[o if P < P h , 

where P ft = liiriA r ->oo £min/N (assuming limit exists) and Z 
is the normalizing constant for the Gibbs distribution. 

IV. Achievable Schemes 

We now consider achievable schemes for the approximate 
ISI channel under that assumption that the channel matrix A//, 
is invertible. In achievable schemes, an information sequence 
B £ {— 1,1} Ar with a well-chosen distribution is encoded 
into a channel input x that satisfies \y n \ > 5. The rate of 
transmission over the approximate ISI channel is H(B)/N. 

A. Zero-forcing with Gibbs distribution 

Let b = {hi, 62 , • • •, hjv} € {—1,1}^ be an instance of the 
information sequence B. Choose the input to the channel as 

x = 6Mf 1 b, (17) 

which implies that the output of the ISI channel is y = M^x = 
5b. This is referred to as zero-forcing because it involves 
channel inversion. Hence, the output of the quantizer s equals 
the information sequence b. When the channel is diagonally 
dominant. Lemma 1 and Section III-B imply that a Gibbs 
distribution on B and the choice x = 5Mf 1 s as the input 
to the channel results in a capacity-achieving scheme. Hence, 
for diagonally-dominant h, the scheme in (17) is optimal when 
b is sampled from the Gibbs distribution given in ( 8 ). 

However, & is a sequence of length N , and it is well known 
that sampling from a Gibbs distribution has exponential com¬ 
plexity in N. For achieving capacity, N should be very large 
which makes this scheme impractical. In the next subsection, 
we compute the entropy rate when b is sampled from a Markov 
chain instead of a Gibbs distribution. 


B. Zero-forcing with Markov input 

As before we choose x = 5Mf l b, where b is the informa¬ 
tion sequence. The sequence b is sampled from a two state 
Markov chain shown in Fig. 3 with transition matrix 


P = 


a 1 — a 
1 — a a 


(18) 


where 0 < a < 1. Observe that s n = b n and the achievable 
rate of this scheme equals the entropy H(B) = H 2 (a), where 
!I>(x) is the binary entropy function. The average transmit 
power, denoted P zm (a), is given by 

P zm (a) = ^E[||X|| 2 ] = ^E [B T M^ T M^B] 

= ^tT{Mi T E[BB T ]M^). 

Using the eigenvalue decomposition of P, it can be shown 


1 — a 



1 — a 

Fig. 3: Markov chain with transition matrix given in (18) 


that 


pa _ 



1 —(2a —I) d 
2 

l+(2a-l) d 


Using (0.5, 0.5) as stationary distribution and P d , we get 


E [b n b n +d] = P d , (19) 


where (3 = 2a — 1. Hence the correlation matrix R = E [BB T ] 
is given by R,, j = /3^~ l I. So for finite N 

Pzm(a) = ^tr (RM- 1 M~ t ). 

When N is large, the Topelitz matrix R can be approximated 
by a circulant matrix [10] and 


5 2 f 2 * 1 

iTwF 


( 2(1 — f3 cos(A)) _ \ 

\ 1 + (3 2 — 2/3 cos(A) ) 

( 20 ) 


The value of a is chosen so as to maximize the entropy rate 
and the rate achieved at power P, denoted P m (P), is obtained 
as 


7Z m (P) = max H 2 (a). (21) 

a-.P zm (a)<P 

V. Numerical examples 

In this section, we evaluate the approximate ISI capacity and 
the rate achieved by the Markov scheme in Section IV-B for 
some sample channels. For numerical evaluation, we assume 
5 = 0.3. 

The channel (1, e), |e| < 1 is a diagonally-dominant channel 
and zero forcing with Gibbs distribution is an optimal strategy. 
For this channel, 

/(A) = l + ee^' A . 

Using (16), the minimum power required for zero-forcing is 
given by 

^ r 2 ” 1 s* 

h 2ir J 0 1 + e 2 + 2e cos(A) 1 — e 2 

While difficult to prove theoretically, by careful simulations, 
it can be observed that the minimum energy £ m ; n is obtained 
for the sequences ±(1,1,...,1). Using this observation, P_ h = 
limjv-KX) £min/N is obtained as 

p - 

- fc (l+£) 2 ' 





1 




Fig. 4: lZ m (P) and C(P) versus normalised energy P/S 2 for Fig. 5: lZ m (P) and C(P) versus normalised energy P/S 2 for 
{1, e} channel {—e, 1, 2e} channel with e = 0.3 


The transmit power required for the two state Markov scheme 
(20) is 


^m(^) — 


s 2 


1 — e 2 


1 + e(l - 2a) 


1 - e(l - 2a) 


Hence the maximum entropy problem for the two state Markov 
chain translates to 


is complete for the case of diagonally-dominant channels. 
Extensions to more general channels and better achievable 
schemes are interesting problems for future study. Another 
important problem is bounding the error in the information 
rate because of the approximation, which is complicated by 
the dependencies introduced by the Gibbs distribution. 


such that 


P-m(P) = ma xH 2 (a), 

11 1 — PS~ 2 {\ — e 2 ) 
a> 2 + Tel + PS- 2 {l-e 2 )' 


The solution of the above problem is given by 


n m (P) 



1 l-PS- 2 (l-e 2 ) \ 
2e l+P<5- 2 (l-e 2 ) J 


if P > Ph 


if 


s 2 

IT+^F 


<P<P h 


if P < 


s 2 

IFhF' 


We first observe that the capacity Cg(P) and the achievable 
rate lZ m (P) match at P_ h and P h being equal to 0 and 
1, respectively. In Fig. 4, the approximate capacity and the 
achievable rate of the Markov scheme are plotted as a function 
of normalized power P/S 2 for e = 0.2 and 0.8. We observe 
that 7 Z m (P) is very close to capacity for e = 0.2 and the gap 
increases with e. 

In Fig. 5, lZ m (P) and Cs(P) are plotted for a non- 
diagonally dominant channel h = (—0.3,1, 0.6). The energies 
£(s) are obtained by numerically solving (4). For this channel 
P h /S 2 « 0.838 and PJ6 2 ps 0.56 and TZ m (P) = 0 for 
P/S 2 < 0.59. We observe that at higher powers the Markov 
scheme is close to capacity even for this non-diagonally 
dominant example. 


VI. Concluding Remarks 
The capacity of the approximate output-quanitzed ISI chan¬ 
nel is characterized using Gibbs distribution, and Markov 
schemes are shown to approach capacity. The characterization 
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